AITopics | dense layer

EUGens: Efficient, Unified and General Dense Layers

Neural Information Processing SystemsJun-13-2026, 23:47:01 GMT

Efficient neural networks are essential for scaling machine learning models to real-time applications and resource-constrained environments. Fully-connected feedforward layers (FFLs) introduce computation and parameter count bottlenecks within neural network architectures. To address this challenge, in this work, we propose a new class of dense layers that generalize standard fully-connected feedforward layers, $\textbf{E}$fficient, $\textbf{U}$nified and $\textbf{Gen}$eral dense layers (EUGens). EUGens leverage random features to approximate standard FFLs and go beyond them by incorporating a direct dependence on the input norms in their computations. The proposed layers unify existing efficient FFL extensions and improve efficiency by reducing inference complexity from quadratic to linear time.

artificial intelligence, machine learning, proceedings, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

51f15efdd170e6043fa02a74882f0470-Paper.pdf

Neural Information Processing SystemsApr-25-2026, 22:00:30 GMT

arxiv preprint arxiv, large language model, machine learning, (18 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)

Add feedback

218344619d8fb95d504ccfa11804073f-Supplemental.pdf

Neural Information Processing SystemsApr-25-2026, 02:07:54 GMT

agent, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.14)

Industry: Transportation (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Searching for Efficient Linear Layers over a Continuous Space of Structured Matrices

Neural Information Processing SystemsMar-17-2026, 21:33:04 GMT

Dense linear layers are the dominant computational bottleneck in large neural networks, presenting a critical need for more efficient alternatives. Previous efforts to develop alternatives have focused on a small number of hand-crafted structured matrices, and have neglected to investigate whether these structures can surpass dense layers in terms of compute-optimal scaling laws when both the model size and training examples are optimally allocated. In this work, we present a unifying framework that enables searching among all linear operators expressible via an Einstein summation. This framework encompasses many previously proposed structures, such as low-rank, Kronecker, Tensor-Train, and Monarch, along with many novel structures. We develop a taxonomy of all such operators based on their computational and algebraic properties, which provides insights into their scaling laws. Combining these insights with empirical evaluation, we identify a subset of structures that achieve equal or better performance than dense layers as a function of training compute. To further improve their compute efficiency, we develop a natural extension of these performant structures that convert them into a sparse Mixture-of-Experts layer. The resulting layer significantly outperforms dense layers in compute-optimal training efficiency for GPT-2 language models.

inductive learning, machine learning, proceedings, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.59)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.59)

Add feedback

c919a2b5ec1de69f2629f9119676e336-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 01:56:50 GMT

artificial intelligence, machine learning, representation, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Pelee: A Real-Time Object Detection System on Mobile Devices

Jun Wang, Tanner Bohn, Charles Ling

Neural Information Processing SystemsFeb-13-2026, 21:00:34 GMT

Neural Information Processing Systems http://nips.cc/

accuracy, computational cost, peleenet, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
North America > Canada > Ontario > Middlesex County > London (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Deep, complex, invertible networks for inversion of transmission effects in multimode optical fibres

Oisín Moran, Piergiorgio Caramazza, Daniele Faccio, Roderick Murray-Smith

Neural Information Processing SystemsFeb-12-2026, 07:26:09 GMT

The experimental data is used to train complex-weighted models with a range of regularisation approaches.

artificial intelligence, fibre, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > Scotland (0.05)
North America > Canada > Quebec > Montreal (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.97)

Add feedback

e4191d610537305de1d294adb121b513-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 20:46:58 GMT

eigenvector, experiment, upstream, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

CMMA: Benchmarking Multi-Affection Detection in Chinese Multi-Modal Conversations

Neural Information Processing SystemsFeb-10-2026, 15:02:02 GMT

Human communication has a multi-modal and multi-affect nature.

machine learning, natural language, sentiment, (21 more...)

Neural Information Processing Systems

Country:

Asia > China > Hong Kong (0.05)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
Asia > China > Tianjin Province > Tianjin (0.04)
(6 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Communications > Social Media (0.69)

Add feedback

A Experiment Details and Complete Results

Neural Information Processing SystemsFeb-9-2026, 17:15:49 GMT

A.2 Model Architectures In this section we describe in detail each of the model architectures we use in our experiments. Our small ConvNet consists of the following layers: A convolutional layer with 32 kernels of size 3 3 and ReLU activation. A max pooling layer with pool size 2 2. A flatten layer. For inputs of shape 32 32 3, this model has 21,697 parameters. Our large ConvNet model consists of the following layers: A convolutional layer with 32 kernels of size 3 3, padding, and ReLU activation.

activation, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.34)

Technology: